Detection of Topic Change in IRC Chat Logs

نویسندگان

  • Alan P. Schmidt
  • Trevor K. M. Stone
چکیده

We attack the problem of topic segmentation in the domain of Internet Relay Chat logs. In this process, we examine the previous work in text segmentation using a variety of methods. After considering the pros and cons of the methods, we employ Text Tiling, pause detection, and latent semantic analysis because they did not require the usage of large pre-tagged corpora. With these systems in place, we consider the properties and problems that exist when considering the domain of internet chat. To this end, we examine our results and show them to be fair at best.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Strangers in a Strange Land Interaction Management on Internet Relay Chat

This article examines a set of interactions (logs) takenfrom t h e f a n of computer-mediated communicntion known as Internet Relay Chat (IRC). The authors were particularly concerned with the interaction management strategies adopted by the participants in the logs during the opening and closing phases of the interactions to d m l o p interpersonal relationships and communicate socioemotional ...

متن کامل

(Dis)agreements in Iranians’ Internet Relay Chats

The present study on politeness is an attempt to examine (dis)agreeing strategies utilized by EFL learners while chatting on the internet. Subjects of the study were forty male and thirty-three female Iranian natives whose internet relay chat (IRC) interactions, composed of 400 excerpts, were collected between December 2007 and September 2008. Data analysis was based on the general taxonomy of ...

متن کامل

Concept drift detection in business process logs using deep learning

Process mining provides a bridge between process modeling and analysis on the one hand and data mining on the other hand. Process mining aims at discovering, monitoring, and improving real processes by extracting knowledge from event logs. However, as most business processes change over time (e.g. the effects of new legislation, seasonal effects and etc.), traditional process mining techniques ...

متن کامل

An Algorithm for Anomaly-based Botnet Detection

We present an anomaly-based algorithm for detecting IRC-based botnet meshes. The algorithm combines an IRC mesh detection component with a TCP scan detection heuristic called the TCP work weight. The IRC component produces two tuples, one for determining the IRC mesh based on IP channel names, and a sub-tuple which collects statistics (including the TCP work weight) on individual IRC hosts in c...

متن کامل

1 Play , Art and Ritual on Irc ( Internet Relay Chat )

one of the world's most popular online chat modes. 1 Usually, IRC participants communicate via typed words. In contrast, this group communicates in real time mainly via the display of brilliantly colored visual images created from letters and other typographic symbols on the computer keyboard. Participants gather in a channel (chat room) called #mirc_rainbow, or " rainbow " for short. 2 While a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003